SUBMITTED TO IEEE TRANSACTIONS ON COMMUNICATIONS 



1 



Controlling the Error Floor in LDPC Decoding 

Shuai Zhang and Christian Schlegel, Fellow, IEEE 



(N 

o 

(N 
O 

O 



C/3 



> 

in 
o 

ON 

o 

(N 

> 
• i-H 

X 



Abstract — The error floor of LDPC is revisited as an effect of 
dynamic message behavior in the so-called absorption sets of the 
code. It is shown that if the signal growth in the absorption sets 
is properly balanced by the growth of set-external messages, the 
error floor can be lowered to essentially arbitrarily low levels. 
Importance sampling techniques are discussed and used to verify 
the analysis, as well as to discuss the impact of iterations and 
message quantization on the code performance in the ultra-low 
BER (error floor) regime. 

Index Terms — absorption set, error floor, importance sampling, 
iterative decoding, low-density parity-check codes, trapping set. 



I. Introduction 

LOW-density parity-check (LDPC) codes are a class of lin- 
ear block codes |1| which have enjoyed intense theoret- 
ical and practical attention due to their excellent performance 
and decoding efficiency. Their parity-check matrices are given 
by sparse binary (0, 1) matrices which enable very efficient 
0(n) -complexity decoder implementations, where n is the 
code length. The error rate of LDPC codes decreases rapidly as 
the signal-to-noise ratio (SNR) increases, and comes very close 
to maximum-likelihood (ML) decoding error performance, 
which is determined by the distance spectrum of the code. 

However, when utilizing sub-optimal 0(n) iterative de- 
coding algorithm, such as message passing (MP) or linear 
programming, a marked increase of error rates at high SNRs 
tends to appear with respect to optimal decoding. The error 
curve assumes the shape of an error floor with a very slow 
decrease with SNR. This error floor is caused by inherent 
structural weaknesses in the code's interconnect network, 
which cause long latencies in the iterative decoder or outright 
lock-up in error patterns associated with these weaknesses. 
These decoding failures are very important for low-error-rate 
applications such as cable modems and optical transmission 
systems. They were initially studied in Q-Q, and called 
trapping sets in [5]. Such trapping sets are dependent not only 
on the code but also on the channel where the code is used, 
as well as the specific decoding algorithm. 

For example, trapping sets on the binary erasure channel are 
known as stopping sets [6], whereas the dominant trapping sets 
of LDPC codes on the Gaussian channel are the absorption 
sets [7'j. Absorption sets are also the sets implicated in the 
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failure mechanism of message-passing decoder for binary 
symmetric channels. 

Due to the importance of error floors in low-BER applica- 
tions, recent efforts have focused on understanding the dynam- 
ics of absorption sets [5|, [8|-|15|, and several modifications 
of the decoding algorithm have been studied to lower the error 
floor, specifically targeting the absorption sets fl2), [13], |16|, 



(T7J. As we discuss in this paper, the onset of the error floor 
on Gaussian channels is very strongly related to the behavior 
of the algorithm on binary symmetric channels, and that the 
dynamics of the absorption sets fully explain why and when 
the error floor becomes dominant in a given code. 

In our earlier work J9j, [ fTT| , p3) we developed a linear 
model to analyze dynamics of and probabilities for the error 
floor for specific codes. We identified and enumerated the 
minimal absorption sets, which dominate the decoding perfor- 
mance in the error floor region for the IEEE 802. 3an and the 
Tanner [155, 64, 20] codes, and derived closed-form formulas 
that approximate the probability that an absorption set falls in 
error. 

In this paper, we first refine the formula for the error floor 
probability [9], fTT) , fT3) , and use it to show that the error 
floor critically depends on the limits, aka thresholds, which 
are utilized to represent the internal log-likelihood messages 
in the decoder. We show that the growth rate of the error 
patterns in the absorption set can be balanced by the growth 
of the LLRs external to the set, if a sufficient dynamic range 
for these messages is available. We quantify this effect and 
compare it to importance sampling guided simulations for 
verification. We verify the hypothesis that, quite unlike in 
the case of binary erasure or binary symmetric channels, 
the error floor phenomenon for LDPC codes on AWGN 
channels is essentially an "artifact" of inexact decoding, and 
not an inherent weakness of the code. We then quantify 
the effect of the number of iterations and show agreement 
of our refined equations with simulated error floor rates for 
both finite iterations and thresholds. Finally, we examine the 
impact of message quantization on the error floor and thus the 
required computational complexity to attain a given level of 
performance in the ultra-low BER regime of the code. 

We wish to note that while sensitivity of the error floor 
to message thresholding has been observed heuristically by a 
number of authors, the view presented in this paper appears to 



have been developed independently by Butler and Siegel [14| 
as well as the authors |9j, p3) . 

This paper is organized as follows: In Section |IlJ LDPC 
codes, iterative decoding, and absorption sets are reviewed, 
and the two example LDPC codes used in our studies are 
introduced. In Section III our linear algebraic approach to 



evaluate the error rate is described as a two-step procedure. 
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Fig. 1. The parity-check matrix of the Tanner [155,64,20] regular (3,5) 
LDPC code. 



The first step is to identify the dominant error patterns and 
the second is performing the analysis targeting these error 
patterns. Based on the insights provided by the analytical 



approach, Section IV studies different decoder settings for both 
the length-155 Tanner and the IEEE 802. 3an LDPC codes. 
The computational complexity and importance sampling are 
discussed in Section [V] Finally, the conclusion will be given 
in Section ED 

II. Background 

In this section, we first review the main features of LDPC 
codes and the most popular iterative decoding algorithms. Two 
LDPC codes will be introduced, as well. Then we apply the 
linear analysis technique of (9) to study their error floors in 
the following sections. 

A. LDPC Codes 

An LDPC code is associated with a sparse parity-check 
matrix, denoted by H mXn , where every column represents a 
code bit, and every row constitutes a parity-check equation. 
The code length is n, and the number of information bits 
k > n — m, since H is not necessarily full rank. If every 
column and every row of H has the same number of non-zero 
elements, then we have a "regular" LDPC code, otherwise the 
code is irregular. A regular LDPC code can also be represented 
by (d v ,d c ), where d v and d c are the Hamming weights of 
each column and each row, respectively, together with an 
interleaves Fig. [T] shows the parity-check matrix of a (3, 5) 
regular LDPC code, where non-zero elements are plotted as 
solid dots and zeros are left blank. 

Following Tanner, an LDPC code can be represented by a 
bipartite graph, called a Tanner graph. Let check nodes denote 
the rows of the parity-check matrix, and variable nodes denote 
the columns of the matrix. Then every non-zero entry in H 
indicates a connection between these two disjoint sets. 



1) Tanner [155, 64, 20] Regular (3, 5) LDPC Code: In (jTS | 
and 1 19 1, Tanner introduced a class of regular LDPC codes 
composed of blocks of circulant permutation matrices. The 
parity-check matrix H of these codes consists of a d v x d c 
array of such circulant permutation matrices where each 



denotes a pxp identity matrix with its rows shifted cyclically 
to the left by k positions. 
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v pxd c p 



Immediately, we can tell that each row of ([TJ has exactly d c , 
and each column d v non-zero elements, making the code a 
regular (d v , d c ) LDPC code. Its length is d c p and the number 
of check nodes is d v p. Its "design" code rate is 1 — d v /d c , 
however the actual rate R will be slightly higher in that within 
every row block, all p rows add to an all-one vector and at 
least d v — 1 rows are linearly dependent. 

The dimension of the base identity matrix p was originally 
designed to be prime to eliminate 4-cycles. In fl9) , p was 
extended to non-primes, where the girth, the shortest cycle in 
the Tanner graph and denoted by g, can be as short as 4. It 
can be shown that g is upper bounded by 12, no matter how 
large n is fl8) , |20|. The minimum distance of a Tanner code 
< (d v + 1)1 



is bounded by d m i n < (d v + 1)! [21 J. When d v is small, this 
upper bound can be met by carefully selecting the parameters 



1 19], 121). Tanner codes come with relatively large girth and/or 
minimum distance, and provide good error floor performance. 
However, low-weight absorption sets are not eliminated by this 
construction. 

Tanner presented a [155,64,20], (3,5) regular LDPC code 
in the "Recent Results" session of the IEEE International 
Symposium on Information Theory (ISIT) in 2000. Its block 
structure parity-check matrix is given by 
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(2) 



93x155 



where each I x is derived by shifting the rows of a 31 x 31 
identity matrix cyclically to the left by x positions. Its binary 
structure is shown in Fig. [T] This Tanner code has rate R sw 
0.4129 and is equipped with a large girth g — 8 and d m [ n = 20 
1 18]. We will use this code as one of the examples to illustrate 
our error floor analysis. 

2) IEEE 802.3an [2048, 1723] Regular (6, 32) LDPC Code: 
The IEEE 802. 3an RS-based low-density parity-check code 
belongs to a special class of binary linear block codes, con- 
structed in p2| , p3|. Its parity-check matrix is also comprised 
of blocks of permutation matrices: 



H 



01,1 Cl,2 
02,1 02.2 



06,1 06,2 



0X32 
02,32 

06,32 



(3) 



384x2048 

where each <Tj j is a 64 x 64 permutation matrix. 

The design code rate is 1 — d v /d c = 0.8125, whereas the 
actual code rate is slightly higher with R = 1723/2048 ps 
0.8413. The Tanner graph representing this code is 4-cycle 
free and the minimal cycle length is g = 6. We believe the 
minimum distance of this code is 14 (it is either 12 or 14 

ED). 
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B. Message-Passing Iterative Decoding Algorithms 



TABLE I 

First few absorption sets of Tanner [155, 64, 20], (3, 5) code. 



Although ML decoding is optimal, it is too complex to im- 
plement. On the other hand, 0(rt)-iterative decoding performs 
extremely well. The idea of iterative decoding is that each 
check node combines all the information sent to it and returns 
a likelihood/suggestion back to the variable nodes connected 
to it. The variable nodes can either make a decision of the 
incoming messages, for example, a majority vote, or combine 
them up and send them back out to the check nodes for another 
iteration cycle. The process will stop when all check nodes are 
satisfied, i.e., a valid codeword is encountered, or the decoder 
reaches a maximum number of iterations allowed. 

Decoding is monotonic in that the more iterations the 
decoder executes, the better the performance. This is strictly 
true for cycle-free codes. When the codes have cycles, the 
messages passing along the edges become dependent after a 
few iterations, which degrades the final decoding performance. 
Therefore, constructing a code with large girth is one way 
to guarantee good performance. In this paper a standard log- 
likelihood message-passing (MP) iterative decoding algorithm 
will be used as benchmark. 



C. LDPC Absorption Sets 

The Gaussian channel differs from the binary symmetric 
channel and the binary erasure channel in that the dynamics of 
the error behavior of the LDPC decoder is more complicated. 
Richardson observed that the failure mechanism in the error 
floor of Gaussian channels was caused by what he called 
trapping sets |5|. But fully classifying trapping sets is a largely 
unsolved, and perhaps unsolvable, problem. Nonetheless, sub- 
sequent investigations noted that a special class of trapping 
sets, called absorption sets, dominates the error floor region 
||7), iJTOj. Furthermore, minimal absorption sets (see later) play 
a critical role in the error floor. 

Definition 1. An absorption set is a set of variable nodes such 
that the majority of the neighboring (connected) check nodes 
of each variable node in the set are connected to the set an 
even number of times. 

Note 1. The importance of absorption sets can easily be 
appreciated by realizing that Gallager's original bit flipping 
decoding algorithm |1| will fail to correct an error pattern 
that falls onto an absorption set, and that such an error will 
therefore persist, even if the iteration count goes to infinity. 

Let an ordered pair (a, b) denote an absorption set where a 
is size of the set and b is the extrinsic message degree, i.e., the 
cardinality of the set (no repetition allowed) of the neighboring 
check nodes that are connected to the set an odd number 
of times (usually once). The belief that "smaller" absorption 
sets causes more severe effects on the error floor (5), |(24j 
is supported by the analytical error floor equations derived 
in |9), flTTl . This is analogous to the fact that lower weight 
codewords have more impact on the error rate than higher 
weight codewords. 
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III. Error Floor Estimation 

Our linear algebraic estimation process proceeds in two 
steps. First, the dominant absorption set topologies along with 
their multiplicities have to be identified for a given code. These 
are required to compute the dynamics of the sets in step two. 



A. Absorption Set Identification 

There exist several papers on absorption set enumeration 
which make use of the topological features of the sets as 
well as algebraic properties used in the construction of the 
codes, j9j-pT[, p5] . The search for absorption sets can be 
either algebraic or simulation-based, where the latter typically 
produces lower bounds on the multiplicities and set varieties. 
We have modified existing techniques and integrated them 
into our search method, in order to exhaustively enumerate 
the topologies and multiplicities in a systematic fashion for 
the more dominant sets up to a certain size, limited by 
computational complexity. 

In the following, we list topological properties of the ab- 
sorption sets of the Tanner [155, 64, 20] and the IEEE802.3an 
[2048, 1723] LDPC codes, respectively. 

1) Tanner [155, 64, 20], (3, 5) LDPC Code: Following the 
enumerating methods in Q, fTT) , we list the first few ab- 
sorption sets of the Tanner [155, 64, 20], (3, 5) LDPC code in 
TABLE I (13). 

Although the (8, 2) set is not the smallest absorption set in 
terms of a, it does have a small number of unsatisfied check 
nodes, which makes it dominant. Fig. [2] shows the subgraph 
induced by the (8, 2) absorption set. 

We observe from Fig. 2(b) that the (8, 2) absorption set 



consists of cycles of length 8, 10, 12, 14 and 16. In addition, 
it is also an extension set of lower weight absorption sets. As 
a matter of fact, all (4,4), (5,3), (6,4) and (7,3) sets are 
contained in (8,2) sets. And 50% of (5,5), 4.1% of (6,6), 
8.6% of (7, 5) sets are contained in (8, 2) sets, respectively. We 
found that this (8, 2) set is the dominant absorption/trapping 
set of this Tanner code on the Gaussian channel under iterative 
MP decoding. 
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TABLE II 

THE FIRST FEW ABSORPTION SETS OF THE IEEE 802. 3AN [2048, 1723], 

(6,32) LDPC CODE. 



(a) Check nodes shown 






(b) Check nodes hidden 



Fig. 2. The topology of the (8, 2) absorption set of the Tanner [155, 64, 20] 
regular (3, 5) LDPC code (not all check node connections shown). 




(b) Check nodes hidden 



Fig. 3. The topology of the dominant (8, 8) absorption set of the IEEE 
802. 3an code (not all check node connections shown). 



Actually, Fig. |2(a)| looks very much like a codeword in that, 
if all those eight variable nodes are erroneous and the rest of 
the variables in the code are correct, then all but two check 
nodes are unsatisfied. 

2) IEEE 802.3an [2048,1723] LDPC Code: Absorption 
sets up to size a = 10 are listed in Table |H| fTT) . 

The subgraph induced by the dominant (8, 8) absorption set 
is depicted in Fig. [3] It can be seen that the topology is highly 
symmetric and also looks like a codeword. From Fig. 3(a) one 
sees that only one-sixth of the neighboring check nodes will 
be unsatisfied when the set is in error. In addition, the graph 
shown in Fig. |3(b)| is also full of cycles of several lengths, 
similar to Fig. |2(b)| 

The authors of [22] have shown that the minimum distance 
of this class of codes is lower bounded by d m i n > d v + 1 
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and has to be even. So we have cL 



> 



for this IEEE 



802. 3an LDPC code. As a corollary of our absorption set 
enumeration, we tightened the lower bound to d m j n > 10, 
because no (a, 0) absorption sets exist for a < 10 as seen 
in Table In] (TTJ. In (25} > it is shown that there are at least 
1, 407 weight- 14 codewords, found by an absorption set search 
algorithm. Therefore, the minimum distance is narrowed down 
to d min = 12 or 140 

B. Linear Algebraic Estimation of the Error Rate 

With the topologies and the multiplicities of the dominant 
absorption sets of both codes in hand, we proceed to approx- 
imate their contribution to the error rate. 

1) Tanner [155, 64, 20], (3, 5) LDPC Code: We apply the 
error formulation from |9), fTT) to this Tanner code using the 
(8, 2) absorption set structure. 

'Evidence suggests that d mln = 14. However our search program for 
(12, 0) absorption sets did not complete due to excessive time requirements. 
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Ai, Ai, Ai, A2, A2, A2, A3, A3, A3, A4, A4, A5, A5, A6, A6, Ag, A7, A7, A7, Ag, As, As 

node © 



node © 



node © 



node node f 



node © 



,(ex) _ 



0, 0, 0,0,0, 0, 0, 0, 0, A^ x) , Ajf , 4 X) , ,0,0,0,0,0,0,0,0,0 



node © 
T 



node * 



(5) 
(6) 



node O node © node © 



node 



node 1 



node © node © node © 



The variable nodes in Fig. 2(a) are labeled from 1 to 8. 
We also label the solid edges from the variable nodes by 
1, 2, . . . , 22 one by one from left to right. Also denote the 
outgoing values from the variable nodes to the satisfied check 
nodes along these edges by Xi, for example, xi,X2,x 3 leave 
variable node O, X4,x$,xs variable node ©, etc. Collect the 
Xi in the length- 2 2 column vector x, which is the vector of 
outgoing variable edge values in the absorption set. Likewise, 
and analogously, let y be the incoming edge values to the vari- 
able nodes, such that yj corresponds to the reverse-direction 
message on edge j, j = 1, 2, . . . , 22. Now, at iteration i = 0, 



x = A, 



(4) 



where the channel intrinsics vector A is defined in It 
undergoes the following operation at the check node: 



y = Cx = CA, 



(7) 



where C is a permutation matrix that reflects the incoming 
messages back to the absorption set. By induction, we obtain 
at iteration i — I: 



i=0 



(VC) l A + (VC) 7 - 4 Af x) ) , 



(8) 



where is the extrinsics vector and defined in |6]), and the 
variable node addition matrix V is defined in (|9j. Note that 
(VC)° = I and A[, ex) = 0. 



V = 
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1 1 
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Both C and V are square matrices and share the dimension 



ad v — b, which is the number of the solid edges in Fig. 2(a) 

Now we calculate the maximum eigenvalue /1 
and its corresponding unit-length eigenvector v E 



of VC 
in <[T0]|, 



since they are dominating the magnitude of the power of VC 
in d8l. 



1.7870, v max = 
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(10) 



The large value of fi max also underscores the dominance of 
this (8, 2) absorption set over others listed in Table [i] 

Following (9), (Tl], (T3), the (8, 2) absorption set falls in 
error if 

/9 = xJ-l<0. (11) 

For simplicity, we separate this expression into two parts, 
namely /3i and fa, as defined in ( fT2] >. 

/ / T 

^ = ^[(VC) i A] T -l + ^[(VC) / - i Af ) ] 1. (12) 



(9) Let 



(13) 



A = I> 

B = wio + vu + W12 + W13 (14) 
C = («i + v 2 + v 3 ) 2 + (v 4 + v 5 + v 6 f + Or + v$ + v 9 ) 2 
+ (vio + vn) 2 + (vi2 + vi 3 ) 2 + (vu + wis + v 16 ) 2 
+ (vn + v ls + v 19 ) 2 + (v 20 + v 2 i + V22) 2 (15) 

(16) 



D = («io + vn) 2 + (vu + v 13 ) 2 
and apply the spectral theorem, 

(VC)* A K„ ax (A T v max ) 



(17) 
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the means and the variances of fix and /?2, individually, 
simplify to (JT^I-gl). 



TO /3i = A 2 E ~!~ mx 

i=0 ^max 

\ i=0 rtnax/ 
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So, the probability of an absorption set falling in error is 
given by: 



I (i) 

m A(-> 



(22) 




^E^+ B E 



* 2Cm A + 2 ^Et 

\^ \i=0 ^max/ i=1 Mmax 



■ (23) 



As illustrated, the factors A,B,C and £> are determined 
by the absorption set topology. Knowledge of (a, 6) and d v 
is encoded into them. For the Tanner code, B or D are 
much smaller than A or C, compared to the case of the 
dominant absorption set of the IEEE 802. 3an code |9), fTT) . 
This implies that the critical extrinsic information has less 
impact on the error rate of the absorption set, which makes it 
more troublesome than the (8, 8) absorption set of the IEEE 
802. 3an code. 

Exchanging extrinsic values at the black check nodes in 
Fig. |2(a)| is only a first order approximation of the actual 
messages returned, and the effect of the remaining d c — 2 inputs 
to these check nodes can be accounted for with an average, 



iteration-depended gain factor which was computed in fTT) as 



91 =E 



d c -2 



n * ann 



i=l 



(24) 



where m represents the mean of the signals /i' cx ) from 
the variable to the check nodes, and is computed using density 
evolution. With this refinement the probability in ( |2"3") is 
refined to ((25). (Note that go is set to 1.) This check node 
gain quickly grows to 1 after a few iterations, which implies 
that the external variable nodes have assumed their corrected 
values with correspondingly large LLR values. 

2) IEEE 802.3an [2048,1723], (6,32) LDPC Code: The 
IEEE 802. 3an LDPC code's dominant absorption set is shown 
in Fig. [3] and has eigenvalue and eigenvector given by (TTJ: 



/^max 
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[1,1,..., 1J 40 x X 



(27) 
(28) 



The symmetry is also reflected in the following coefficients. 
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(29) 



D. 



(30) 



Substituting ((29) and (|30) into ((25), the probability of the 
(8, 8) absorption set falling in error is computed as ( |26) , which 
is equivalent to setting A = C = a = 8 and B = D = b = 8. 
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i=0 \r max 1=0 91 J i=l V Mmax /=! 91 , 



2C7m A (V(— TT-) ) +2DJ2 



a 21 

rmax 



i=0 V^max ;=0 9lJ i=1 \ Mmax J=1 ffJ y 



f i 



max l=Q 9l / 



E 



(0 

/max 



(25) 



(26) 
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C. Error Probability Formula Refinement 

Besides the relative simplicity of ( |2"3") and ( |25) , one of the 
important insights we gain is that /i max and v max play a crucial 
role in the decoding failure mechanism. Absorption sets with 
large fj, max have more impact on the code performance, since 
they accelerate the growth of erroneous LLRs within the set. 
The error formula can be strengthened, and in this section, we 
revisit its derivation to obtain a more accurate formula. 

1) Pas-' We interpreted the probability of ( [TTj ) happening 
as constituting the probability that an absorption set falls in 
error in (5J, fTT| . This condition can be strengthened because 
at the last iteration /, all elements of x/ are negative by virtue 
of ([TO). The failure probability is more accurately defined as 



P AS = Pr(x, < 0). 



(31) 



In addition, all elements of xj are linear combinations 
of Ai , A 2 , . . . , A Q and A^ , A^ 2 x) , . . . , , where i = 
1,2, ... ,1. Therefore, ( |3"T) is equivalent to saying that 



Pas = Pr ( max {xi j) < 



(32) 



In other words, if the message with the maximum value is 
negative at the 7-th iteration, then all the other variable nodes 
have negative LLRs as well. 

Analogously, we rewrite ([8) into two parts (3 1 and (3 2 , but 
as vectors this time. 



P = x f = ^(VC) J A + ^(VC) / - l A (exi 

i=0 i=0 

Taking the mean of (3 1 and f3 2 we obtain 
m Pl = m^(VC) l l, 



(33) 



i-i 



(34) 



(35) 



for the 
and 



absorption set of the IEEE 802. 3an LDPC code, 



7-1 



m /3 2 = '• 



(vcy[o,...,o, i,i, i,i, o,...,o] r 

node © node © 



(36) 

as mean of f3 2 for the (8, 2) absorption set of the Tanner 
[155, 64, 20] code. 

It can be seen that the maximum entry of v max is dominat- 
ing the elements of m ( g i + mp o . 

In the case of the (8, 2) absorption set of the Tanner code, 
vq or t>i5 are the maximum entries of v max , as can be seen 
in ( fTO) . Thus ([32) becomes 



(37) 



Pas = Pr max(x /j ) < = Pr {x Ifi < 0) . 



And our refined error probability formula can be derived 
accordingly, using the same technique developed in Section 
|III-B| It will retain the form of ( |25) , but is more accurate. 



For the IEEE 802. 3an LDPC code, all v max entries have 
equal size, as shown in (j28]». Selecting any vj for ( [32) will 
result in ( |26) obtained using ([TTJ) and ( |22) . Therefore, this 
modification will not affect the result for the IEEE 802. 3an 
code in (9), |[TTJ, and 



Pr (xj ■ 1 < 0) = Pr ( max(x Itj ) < 



(38) 



2) Spectral Approximation: In ( |25) and ( |26) , we use the 
approximation 



(VC)*A « Knax (A T V„ 



1,2,...,/, (39) 



to estimate the power of VC. This is not very accurate 
when i is small. The results from the early iterations are 
of great importance. So we will drop this approximation and 
use the matrix formulation ([34)-([36) when plotting error rate 
estimates. 

3) Numerical Results: The refined formula predicting the 
error floor of the Tanner [155, 64, 20], (3, 5) code is plotted in 
Fig. [6] as solid curves. The circles are numerical results from 
importance sampling (IS) to support our algebraic analysis. 
Error rates for the IEEE 802. 3an code are shown in Fig. [5] 

IV. Error Floor Reduction 



The error floor formula ( |25) can be used to look for 
contributing factors that come into play in the ultra-low BER 
regime. 

The absorption set structure determines the magnitude of 
/j max , which affects how fast the set LLRs grow. The coeffi- 
cients A, B, C and D depend on the code structure, as shown 
in (fT3)-([T6) and |29)-([30), and the topology of the absorption 
sets. The mean of the intrinsics m\ dependents soley on the 
channel LLR and has no impact on the error floor. 

It is common understanding that simply increasing the 
maximum number of iterations will not solve the trapping set 
problem. This is because the absorption set will stabilize after 
a few iterations as soon as the clipping levels are reached 
by the growing LLR messages. This convergence to a bit- 
flipping operation is exemplified in Fig. |4] From Note [JJ now, 
the absorption set will remain in error even for I — > oo. 

The decisive variable in the error floor formula is m A („ X ), 
the mean of the signals injected into the absorption set through 



its unsatisfied check nodes, shown in Fig. 2(a) and Fig. 3(a) 



It is evident from d25) that the argument of the Q-function can 

fir — d) 
grow only if m^ (ex) can outpace /i rnax as i — > oo. Therefore, 

clipping of m A ( C x) has a major impact on the error probability. 

By density evolution the extrinsics represented by rn x ^ x) 

will grow to infinity, however, using 



2 tanh 



IT tanh 



in the decoding algorithm effectively clips \\ due to numer- 
ical limitations of computing the inverse hyperbolic tangent 
of a number close to unity. This is observed and discussed 
in p4| as saturation. Therefore, instead of ( |40) , we use the 



(40) 
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CD 
O 



LLR clipping 




(2048,1723) LDPC: BER vs SNR 



Fig. 4. The accumulated LLRs at an (8, 8) absorption set nodes of the IEEE 
802. 3an LDPC code versus iterations, assuming all-zero codeword transmitted. 



corrected min-sum algorithm |26|, which computes the check- 
to-variable LLR as 



A 



mm 

lev. 



iev 3 \{i} 



(41) 

where CT represents a correcting term dependent on the check 
node degree, 



CT = 



ln(rf c -l) 



0, 



if mm ie ^\ {i} 
otherwise, 



> 



3 ln(d c 



(42) 

to avoid the tanh(-) calculation. 

Fig. [5] shows the error rates of the IEEE 802. 3an LDPC 
code in its error floor region. The dashed curves plot 



for I 



(i) 

10 and m\ and m x ( ox) are bounded by 10, which is 
the clipping threshold. The circles are the numerical results of 
importance sampling (IS) utilizing ( |4"Tj ) with the same / and 
LLR clipping threshold. When the threshold is increased to 
100, the error rate decreases as |26) predicts, also shown in 
Fig. [5] and supported by IS simulation. The error rate further 
decreases with the even bigger LLR clipping value of 1,000. 

Despite the high sensitivity of the formula to numerical 
issues, IS simulations and analytical results agree to within 
a few tenths of a dB over the entire range of E^/No values 
of interest. 

Regarding the short code, Fig. [6] shows the error rates 
of the Tanner [155, 64, 20] LDPC code. Once again, with 
higher LLR clipping value, ( |25] l predicts that the error rate 
will decrease accordingly. This is supported by IS when the 
clipping threshold is raised from 10 to 100. However, the IS 
results of LLR clipped at 1,000 are not as low as suggested by 
( |2"5| >, see the black stars in Fig. [6] This is due to the short length 
of this code. Shortly after the decoding procedure begins, the 
LLRs will become so correlated that the extrinsics A[ ox ' ) start 
to depart from the behavior predicted by density evolution. 
However, eventually, the absorption set is still corrected given 
more iterations are permitted [13]. The qualitative observations 
made above are therefore valid also for short codes, even 
though the quantitative prediction power of the error formulas 



10 
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io~ : 
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Fig. 5. Bit eiTor rates of the IEEE 802. 3an LDPC code using both formula 
{26} and importance sampling (IS) with LLR clippings at 10, 100 and 1000, 
respectively. The iteration number is set to 10. 



(155,64) Tanner LDPC: BER vs SNR 




10 



Clipping=100 
*IS@Clipping1000 



6.5 



7.5 



8.5 
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Fig. 6. Bit eiTor rates of the Tanner [155, 64, 20] LDPC code using both 
formula (25) and importance sampling (IS) with LLR clippings at 10, 100 
and 1000, respectively. The maximum iteration number is set to 50. 



breaks down. 

V. Iterations and Complexity 

Larger LLR clipping thresholds imply more complexity in 
practice, since wider bit widths are needed to represent the 
messages. Fig. [5] shows how much gain can be achieved 
by increasing the clipping values, in terms of lowering the 
error rate of the IEEE 802. 3an LDPC code. We note that the 
theoretical results correlate well with the IS simulations. 
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Regarding the Tanner [155,64,20] code, simply increasing 
the clipping threshold to over 100 alone has no additional 
benefit, as shown in Fig. [6] The correlation among the LLRs 
of this small code compromise the growth of the extrinsic 
information entering the absorption sets. 

Motivated by these observations, in this section we explore 
the impact of message quantization and number of iterations 
on a code's error floor. Arguably, the product of bit-width and 
number of iterations is an accurate measure of the computa- 
tional complexity of a decoder, since this number is directly 



related to the switching activity of a digital decoder (see 1 27 1), 
and hence also the energy expended by the decoder. 

In order to verify our theoretical results, we resort again 
to importance sampling (IS). IS is an ideal tool to explore 
variations of a decoder, such as finite precision operation, 
where the impact of design changes need to be explored for 
ultra-low error rates. Given the sensitivity of IS, we briefly 
discuss some major points here and relate our own experiences 
with IS applied to LDPC decoding. 

Importance sampling is a method where one increases 
the number of significant events in low event-rate testing 
environments such as bit error rate testing or simulation. The 
basic principle of importance sampling is rooted in Monte- 
Carlo sampling. Specifically, here we wish to evaluate the 
probability that noise carries an original signal xq into the 
decision region of another signal, thus causing a decoding 
error. If N s noise samples are selected according to the 
channel's noise distribution, an estimate of this error can be 
obtained as 



i—1 



(43) 



which is an unbiased estimate of the true error probability 



P= p{y\x ) dy, 
Jv 



(44) 



where the weighting index is simply the error indicator 



V is the signal space region where the decoder fails to 
produce the correct output xo, and p(y\xo) is the conditional 
probability density function of the received signal y given the 
transmitted signal xq, which is related to the noise distribution, 
for example, the Gaussian noise density in our case. 

Unfortunately, for low values of P, one has to generate 
on the order of 10/P samples to obtain statistically relevant 
numbers of error events. For instance, the error floor of the 
IEEE 802. 3an LDPC code appears below a bit error rate of 
10 -10 , as shown in Fig. [5] which requires 10 11 to 10 12 samples 
to be simulated and tested, and with lowering the error floor 
the number of test samples increases further. 

One way of increasing the number of error events, or 
positive counts, is to distort the noise distribution to cause 
more errors. This is typically done by shifting the mean of 
the noise towards a convenient boundary of T) (mean-shift 
importance sampling). The key questions are where to shift 
the transmitted signal and by how much. 



A priori knowledge of the dominant error mechanisms is 
extremely important for proper use of IS, since otherwise a 
mean shift can actually mask an error by moving the signal 
further away from the dominant error event. Furthermore, the 
correct amount of the shift is also important. If the shift value 
is too small, not enough simulation speed up is achieved, if 
the shift value is too large, a phenomenon called over-biasing 
causes the IS error estimate to dramatically underestimate the 
true error contribution by the dominant event. This happens 
when the biased simulated samples occur too far away from 
the decision boundary, but inside the error region. These 
samples are weighed with a index that is too small. Not enough 
samples are generated close to the decision boundary from 
where the majority of the actual error contribution originates. 

Since absorption sets are examined as the primary causes 
of the significant events in the error floor region, we add 
such a mean shift, or bias, towards the bits that make up the 
absorption set. Having identified and enumerated the dominant 
absorption sets of an LDPC code, we technically need to 
perform this shift for each absorption set separately, but sym- 
metries can often be exploited in reducing this task. Gaussian 
distributed noise is added to the thus biased codeword xq. As 
a result, the biased received y will have an increased chance 
of causing an error by failing on the favored absorption set. 
Consequently the sample size N s can be significantly reduced, 
which translates into what is called "the gain" of importance 
sampling. Richardson [5] used a version of IS where this 
mean shift assumes a continuous distribution over which the 
simulations are averaged. This method appears to alleviate 
the over-biasing, but also obscures the phenomenon. In our 
approach we carefully choose the mean shift by assuring that 
small changes do not alter the computed error rate. Mean 
shift values used for some of our simulations are marked in 
the figures. We finally wish to note that, unlike sometimes 
implied in the literature, IS can only properly reproduce an 
error floor if the causes of the error floor are sufficiently 
well understood to apply proper biasing. Hence our two-step 
procedure in Section III. 

Taking into account the bias value and the dominant ab- 
sorption sets selected in the operation, the estimation formula 
( |4"3"j ) along with the weighting factor ( |4"5j ) must be adjusted 
accordingly, leading to a weight term w(y) <§; 1. The com- 
bined effect of measuring more significant events and ascribing 
them lower weight will produce the same error rate measure 
in ( |43~| if the shifting is done correctly. Figs. [5] and [6] show the 
results of importance sampling for the IEEE 802. 3an and the 
Tanner [155, 64, 20] LDPC codes compared to our formulas 
for foating point calculations. 

In a hardware implementation, however, finite precision 
arithmetic is used. The more digits used in the decoder, the 
more power and computational effort is required, but better 
performance will results. It is therefore vitally important for 
implementations to understand this cost-benefit tradeoff. For 
the IEEE 802. 3an code, IS simulations with both floating and 
fixed point calculations at different LLR clipping values are 
shown in Fig. [7] Not surprisingly, for smaller clipping values, 
smaller numbers of bits are required to adequately represent 
the messages. While 6 bits of quantization are required for 
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Fig. 7. Error rates of the IEEE 802. 3an LDPC code via importance sampling 
(IS) with finite precision, where the LLRs are clipped at 10, 100 and 1000, 
respectively. The maximum iteration number is preset at 10. 

a clipping threshold of 10, 10 bits are needed for a clipping 
threshold of 100, and 14 bits of quantization are required to 
exploit the full benefit of a clipping threshold of 1000. 

VI. Conclusion 

We revisited the error floor of LDPC codes and showed how 
the level of this floor can be controlled in a very large range by 
a proper choice of message representation inside the iterative 
message-passing decoder. In particular, the maximum allowed 
value of these messages, the clipping threshold, as well as the 
resolution of finite-precision arithmetic, have a key impact on 
the level of the floor. For short codes, additionally, very large 
iteration numbers may be required. 

We verified our findings and error floor control mechanisms 
by extending our recent linearized analysis theory, and numer- 
ically verifying the results with importance sampling of the 
dominant absorption sets for two representative LDPC codes, 
namely the Tanner [155,64,20] regular (3,5) code and the 
larger IEEE 802.3an [2048, 1723] regular (6, 32) code. We 
conclude that the error floor can be lowered by many orders of 
magnitude, or made to virtually disappear, with proper setting 
of the message parameters, which allows the extrinsic signals 
to outpace the error growth inside the absorption sets. 
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